Hidden Representation after Reinforcement Learning of Hand Reaching Movement with Variable Link Length

نویسندگان

  • Katsunari Shibata
  • Koji Ito
چکیده

Iriki et al. reported interesting results regarding the visual receptive field of two kinds of neurons in the parietal cortex of a monkey. A monkey did a task to reach its hand or tool to a target. The receptive field of one kind of neuron was enlarged when the monkey used the tool grasped by its hand. The receptive field of the other type of neurons moved together with its hand even though the hand was hidden under an opaque plate. They discussed those results in relation to high-order cognitive functions such as body image and symbolization[1]-[3]. In this paper, a hypothesis is posited that these neurons contribute to generate the critic output (state evaluation in a given task) and are obtained through reinforcement learning. Thereby, tool use is considered to be the change of link length for simplicity; a layered neural network learns hand reaching by a manipulator based on reinforcement learning. Inputs of the network are visual sensory signals and the state of the manipulator. Outputs are the critic and joint torques as the actor. After learning, the manipulator came to move its hand toward the target on the visual sensor when the target was located within the hand’s reach. Both types of neurons observed in experiments of Iriki et al. were found in the hidden layer of the neural network.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acquisition of Reaching Movement Based on Reinforcement Learning

A learning system with a neural network whose inputs are visual sensory signals and state of a manipulator, and whose outputs are joint torques, obtains the hand reaching movement by reinforcement learning. This paper shows some simulation results in the case of (1)variable link length and in the case of (2)external force which is loaded at the hand. In the former, a hidden neuron's representat...

متن کامل

Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State

We present Utile Suffix Memory, a reinforcement learning algorithm that uses short-term memory to overcome the state aliasing that results from hidden state. By combining the advantages of previous work in instance-based (or “memorybased”) learning and previous work with statistical tests for separating noise from task structure, the method learns quickly, creates only as much memory as needed ...

متن کامل

Combination of Reinforcement Learning and Dynamic Self Organizing Map for Robot Arm Control

This paper shows that a system with two link arm can obtain arm reaching movement to a target object by combination of reinforcement learning and dynamic self organizing map. Proposed model in this paper present state and action space of reinforcement learning with dynamis self organizing maps. Because these spaces are continuous. proposed model uses two dynamic self-organizing maps (DSOM) to e...

متن کامل

Somatic and Reinforcement-Based Plasticity in the Initial Stages of Human Motor Learning.

As one learns to dance or play tennis, the desired somatosensory state is typically unknown. Trial and error is important as motor behavior is shaped by successful and unsuccessful movements. As an experimental model, we designed a task in which human participants make reaching movements to a hidden target and receive positive reinforcement when successful. We identified somatic and reinforceme...

متن کامل

Effect of Force Load in Hand Reaching Movement Acquired by Reinforcement Learning

It has been known that when a human moves its hand to a target, the trajectories becomes almost a straight line from the start point to the target. When a viscosity force field is loaded to the hand unexpectedly, it is pulled toward the force direction once and then goes back to the target. However, after the learning in the force field, the trajectory becomes a straight line again, and when th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990